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Abstract 

Reanalysis products from MERRA, NCEP2, NCEPl, and ECMWF were used to force an 
established ocean biogeochemical model to estimate air-sea carbon fluxes (PCO 2 ) and partial 
pressure of carbon dioxide (PCO 2 ) in the global oceans. Global air-sea carbon fluxes and pC02 
were relatively insensitive to the choice of forcing reanalysis. Ah global FCO 2 estimates from 
the model forced by the four different reanalyses were within 20% of in situ estimates (MERRA 
and NCEPl were within 7%), and ah models exhibited statistically signihcant positive 
correlations with in situ estimates across the 12 major oceanographic basins. Global pC02 
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estimates were within 1% of in situ estimates with ECMWF being the outlier at 0.6%. Basin 
correlations were similar to FCO 2 . There were, however, substantial departures among basin 
estimates from the different reanalysis forcings. The high latitudes and tropics had the largest 
ranges in estimated fluxes among the reanalyses. Regional pCOa differences among the 
reanalysis forcings were muted relative to the FCO 2 results. No individual reanalysis was 
uniformly better or worse in the major oceanographic basins. The results provide information on 
the characterization of uncertainty in ocean carbon models due to choice of reanalysis forcing. 


Keywords: Reanalysis, ocean, carbon, fluxes, models 

1.0 Introduction 

The oceans play a critical role in the global carbon cycle. More than 90% of the active non- 
geological carbon pool resides in the oceans (Kaufman, 1998). Fstimates of global primary 
production suggest that the oceans contribute about half (Field et ah, 1998). One quarter (Fe 
Quere et ah, 2010) of the carbon emitted by anthropogenic sources is thought to be sequestered 
in the oceans, annually. Understanding the role of the ocean in the global carbon cycle is a 
driving question in modern Farth science. It requires foremost a geographically-distributed, 
well-maintained observational capability. We are fortunate that such a capability exists or is in 
development, and that global data sets of ocean carbon inventories (Key et ah, 2004), partial 
pressure of CO 2 (Takahashi et ah, 2006; 2009) and ocean-atmospheric exchange (Takahashi et 
ah, 2006; 2009) are publicly available. 

Global ocean carbon models require external information to drive the ocean circulation 
dynamics that determine the distributions, abundances, and atmospheric exchange of carbon. 
Additionally, biological and chemical constituents that play important roles in the ocean carbon 
cycle are affected by ocean circulation. These forcing fields can be from a coupled atmosphere 
model or from atmospheric and ocean data. In the latter case, the data typically come from 
publicly available reanalysis products (e.g., FeQuere et ah, 2010; Gorgues et ah, 2010; Doney et 
ah, 2009). It is clear that different ocean models produce different estimates of air-sea fluxes 
(BChatiwala et ah, 2013), but less effort has been given to the influences of different reanalysis 
products. These differences in reanalysis products and their potential effects on simulated ocean 


2 


61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

83 

84 

85 

86 

87 

88 

89 

90 

91 


carbon distributions and trends have been cause for concern by ocean modelers (LeQuere et ah, 

2010 ). 

Here we intercompare model air-sea flux estimates and partial pressure of carbon dioxide 
(PCO 2 ) from a model forced by four reanalysis products. These include The Modem-Era 
Retrospective analysis for Research and Applications (MERRA; Rienecker et ah, 2011), two 
from the National Center for Environmental Prediction (NCEP); NCEP2 (Kanamitsu et ah, 2002) 
and NCEPl (Kalnay et ah, 1996), and one from the European Centre for Medium-range Weather 
Eorecasts (ECMWF; Dee et ah, 2011). This study provides an opportunity to evaluate how the 
differences in reanalysis products propagate through the same ocean biogeochemical model to 
affect representations of carbon fluxes and pC 02 . 

This effort is potentially important not only to ocean carbon modelers, but also for reanalysis 
developers and analysts, satellite mission conceptual designers, and atmospheric scientists as 
well. The objective of this study is to provide quantitative information on the spatial 
distributions of air-sea carbon fluxes and ocean pC 02 globally, regionally, and sub-regionally in 
a model forced by the four state-of-the-art, widely used reanalysis products listed above. Such 
information can guide scientists and analysts in their selection, uses, and potential pitfalls of 
different reanalysis products in the context of ocean carbon models. 

2.0 Material and Methods 

2.1 Global Three-Dimensional Circulation Model 

Global ocean carbon dynamics are simulated by the NASA Ocean Biogeochemical Model 
(NOBM; Figure 1). It is a three-dimensional representation of coupled circulation/ 
biogeochemical/radiative processes in the global oceans (Gregg et ah, 2003; Gregg and Casey, 
2007). It spans the domain from 84°S to 72°N latitude in increments of 1.25° longitude by 2/3° 
latitude, including only open ocean areas, where bottom depth >200m. The circulation model is 
quasi-isopycnal, with 14 vertical layers, driven by the forcing fields shown in Figure 1 (Schopf 
and Loughe, 1995). It relaxes to sea surface temperature obtained from MERRA and surface 
salinity obtained from the National Oceanographic Data Center (NODC, Conkright et ah, 2002). 
The biogeochemical processes model contains 4 phytoplankton groups, 4 nutrient groups, a 
single herbivore group, and 3 detrital pools. The phytoplankton groups differ in maximum 
growth rates, sinking rates, nutrient requirements, and optical properties. The 4 nutrients are 


3 


92 

93 

94 

95 

96 

97 

98 

99 

100 

101 

102 

103 

104 

105 

106 

107 

108 

109 

no 

111 

112 

113 

114 

115 

116 

117 

118 

119 

120 

121 


nitrate, regenerated ammonium, silica to regulate diatom growth, and iron. Three detrital pools 
provide storage of organic material, sinking, and eventual remineralization. 

Carbon cycling involves dissolved organic carbon (DOC) and dissolved inorganic carbon 
(DIC; Figure 2). DOC has sources from phytoplankton, herbivores, and carbon detritus, and a 
sink to DIC. DIC has sources from phytoplankton, herbivores, carbon detritus, and DOC, and is 
allowed to exchange with the atmosphere, which can be either a source or sink. The ecosystem 
sink for DIC is phytoplankton, through photosynthesis. This represents the biological pump 
portion of the carbon dynamics. The solubility pump portion is represented by the interactions 
among temperature, alkalinity (parameterized as a function of salinity), silica, and phosphate 
(parameterized as a function of nitrate). The alkalinity/salinity parameterization utilizes the 
spatial variability of salinity in the model adjusted to mean alkalinity 

TA = TA S/S 

where TA is total alkalinity and S is salinity. The underscore represents global mean values. TA 
is specified as 2310 pE kg'* (Ocean Model Intercomparison Project (OCMIP; 


www.ipsl.jussieu.fr/OCMIP) and S as 34.8 PSU (global model mean). Since the model contains 
nitrate but not phosphate, we estimate phosphate by multiplying nitrate by 0.1. This is derived 
from the global mean ratio of nitrate to phosphate from NODC for their top three standard levels. 
The calculations for the solubility pump follow the standards set by the Ocean Model 
Intercomparison Project (reference above). We recognize that this approximation for alkalinity 
is not optimal, but the surface results compare favorably with data (see Gregg et ah, 2013). The 
difference between the model and GLODAP global surface alkalinity is 2.7 pEq f* (= 0.1%) 
with basin correlation of 0.95 (P<0.05) (Gregg et ah, 2013). We consider this sufficient for the 
present purpose of intercomparing model results from forcing by different reanalysis products. 

We employ a locally-developed lookup table valid over modern ranges of DIC, salinity, 
temperature, and nutrients for computational efficiency, at little cost to accuracy. Air-sea CO 2 
exchange as a function of wind uses the Wanninkhof (1992) formulation, as is common in global 
and regional ocean carbon models (e.g., McKinley et ah, 2006). A more complete description of 
NOBM can be found in Gregg et al. (2013). 
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NOBM is spun-up for 200 years under climatological forcing from each reanalysis. Initial 
conditions for DIC are derived from the Global Data Analysis Project (GLODAP; Key et ah, 
2004). DOC initial conditions are set to 0 pM. Subsequent tests with non-zero DOC initial 
conditions showed negligible differences. Other initial conditions are described in Gregg and 
Casey (2007). For MERRA forcing, the first ten years of the run show a net pC02 difference 
ApC02 (year 10-year 1) of -0.982 patm, at the first hundred years the 10-year ApC02 (year 100- 
year 91) is 0.413 patm, and at 200 years, the 10-year ApC02 (year 200-year 191) is 0.102 patm 
(Figure 3). This 200-year model spinup may not be sufficient for full adjustment of all variables 
at all depths, but appears satisfactory for surface pC 02 and nutrients, which is the focus of this 
effort. The results from the last year (year 200 of each reanalysis spinup) are compared with in 
situ data and with one another. 

2.2 DataSets 

2.2.1 Forcing Data 

Forcing data variables are shown in Figure 1. Monthly climatologies are used in all cases. All are 
obtained from reanalysis products except soil dust (iron), ozone, clouds, and atmospheric CO 2 . Iron 
is derived from soil dust deposition estimates from the Goddard Chemistry Aerosol Radiation and 
Transport model (Ginoux et ah, 2001). Ozone is obtained from the Total Ozone Mapping 
Spectrometer and Ozone Monitoring Instrument and cloud information (specifically cloud cover 
and liquid water path) are obtained from the International Satellite Cloud Climatology Project. 
Atmospheric CO 2 is from the Lamont-Doherty Earth Observatory (LDEO) data set (Takahashi et 
ah, 2009), using a mean over the entire range of observations of 358.7 patm. Although the ocean 
PCO 2 observations are nominally normalized to the year 2000 (Takahashi et ah, 2009), we keep the 
uncorrected mean atmospheric value from the data to represent variability at the time and location 
of measurement. However, tests using year 2000-normalized atmospheric pC02 and MERRA 
forcing showed a difference in air-sea fluxes of only 0.034 mol Cm' y' , or about 10.3%. This 
produced a slightly worse comparison with in situ estimates (7.8% as compared to -2.3%), but for 
the present purposes consistent atmospheric pC 02 is the important consideration. 

2.2.2 Comparison Data 

The main output of interest in this effort is the flux of CO 2 (FCO 2 , notation following Arrigo et 
ah, 2010; Doney et ah, 2009), representing the exchange of carbon between the atmosphere and 
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ocean. Positive air-sea flux is defined here as upward, indieating a source to the atmosphere. 
Additionally we compare with global observations of ocean partial pressure of carbon dioxide 
pC02. Both FCO 2 and pC02 data sets are obtained as gridded datasets on a 5° longitude by 4“ 
latitude horizontal grid and are surface only. They are obtained from the Lamont-Doherty Earth 
Observatory (LDEO) 

(http://ediac.oml.gov/oeeans/EDEQ Underway Database/index.html ; Takahashi et ah, 2009). 
The ECO 2 estimates are derived from 1) the oeean pC02 data using atmospheric pC02 to 
compute ApC02 which is then normalized to the year 2000, 2) wind speeds from NCEP2 and 3) 
an estimate of the gas transfer coefficient (see Takahashi et ah, 2009). 

In addition to the gridded data sets available from LDEO, shipboard underway pC02 data at 
the loeation of data measurement, ungridded, with temporal sampling identified, with sampling 
gaps preserved, and inelusive of all years sampled, are available 
(http://ediac.ornl.gOv/ftp/oeeans/LDEO_Database/Version_2009/). Using these raw observations 
we can re-constmct the representation of pC02 data at our model grid. By sub-sampling the 
model by the data locations, we ean remove the mismatches due to data scaling, and produce a 
less biased, one-to-one eomparison. We use these to eompare with co-loeated, eoincident 
estimates of pC02 from the MERRA model forcing version to understand the effects of gridding 
and sampling on the global gridded representations of pC 02 . 

Carbon flux estimates are not available in the ungridded data from LDEO, but we ean estimate 
them from pC02 and elimatologieal ocean and atmospheric variables using the OCMIP 
protocols, similar to the way FCO 2 is computed by the model. The required variables are wind 
speed, sea level pressure, and atmospheric pC02. While all of these are derived from or foree the 
model in the model derivation of FCO 2 , we use data elimatologies here to estimate FCO 2 from 
the LDEO pC02 point measurement data. The data are taken from LDEO to retain as mueh 
eonsisteney as possible. 

2.2.3 Evaluation 

Results are evaluated globally and regionally in 12 major oceanographic basins (Figure 4) from 
the forcing by each of the four reanalysis produets. Comparisons are statistieal, ineluding 
differenees between model global and regional means and eorrelation analysis. Our emphasis is 
on large temporal and spatial scale results, using annual area-weighted means and eorrelation 
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analysis across the basins (N=12, with 10 degrees of freedom). We additionally compare model 
pC02 and FCO 2 from one of the reanalyses, MERRA, against in situ data sub-regionally to 
estimate the influences of inherent model biases on the results shown in the intercomparison of 
reanalysis products. 


3.0 Results 

Global annual mean FCO 2 from the model forced by the four different reanalysis products 
show considerable spatial similarity (Figure 5). The difference between the lowest estimate, 
NCEP2 (-0.276 mol C m'^ y'^) and the highest, ECMWF (-0.402 mol C m'^ y'^) is about 0.13 mol 
Cm' y' , or about 45%. MERRA foreing is closest to in situ estimates (within 0.008 mol C m' 
y'*’ or 2%), with NCEPl only slightly more distant (by 0.024 mol C m'^ y'*, or 7.0%). 
Correlations with in situ estimates across basins are positive and statistieally signifieant (P<0.05) 
for all forcing, with correlation coefficient ranging from 0.73 (MERRA and ECMWF) to 0.80 
(NCEPl). 

There are, however, substantial differences in basin-seale estimates of FCO 2 among the various 
reanalysis forcings, especially in the high latitudes and tropics (Figure 5). In the high latitudes (> 
+40° latitude), all the forcings produce strong sinks in the oeeans, in accordance with the in situ 
estimates, but all are weaker than the data. The NCEP2 sink in the Antaretic is the lowest (-0.97 
mol C m' y' ), representing only about a third the magnitude of the next smallest sink 
(ECMWF). NCEP2 forcing produces an intensification and expansion of local source regions 
near 60°S latitude in the Antarctic (Figure 6). This counters the amplification of the sink regions 
just to the north. MERRA forcing produces the smallest sink in the North Pacific and North 
Atlantic basins (Figure 5). The weaker sink in the North Paeific can be attributed to a source 
region east of the Sea of Okhostk (Figure 6), and the North Atlantic to a local source in the 
Labrador Sea. MERRA-estimated fluxes in these two basins is about 0.15 mol C m' y' (39%) 
lower in the North Pacific than the strongest sink and 0.33 mol C m' y' (21%) lower in the 
North Atlantic. The strongest sink in both cases is produced by NCEP2. 

In the tropieal basins, the estimates of air-sea carbon fluxes by NCEP2 produce the strongest 
source in 3 of the 4 major basins (Figure 5). Sometime this is eloser to the in situ estimates 
relative to the other forcings, as in the Equatorial Atlantic, and sometimes it is a larger departure. 
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as in the Equatorial Indian. The large source represented by NCEP2 forcing in the Equatorial 
Pacific is derived from a very strong local flux along the Peru coast (Eigure 6). Although a 
smaller manifestation appears in NCEPl and ECMWE forcing, it does not appear in MERRA- 
forcing, which leads to its representation of the smallest Equatorial Pacific source. ECMWE 
departs strongly from the other forcings in the North Indian, and is nearly 3 times the fluxes 
estimated by the lowest reanalysis (NCEPl), but is closer to the in situ estimates (Figure 5). 
This stronger source can be attributed to local intensification offshore of Somalia (Figure 6), 
which feature is either much smaller in the other forcings (NCEPl) or non-existent (MERRA and 
NCEP2). 

Estimates of FCO 2 in the sub-polar basins are more similar among the forcings than the high 
latitudes and tropics (Figure 5), exhibiting the lowest ranges of estimates of all the basins. 
ECMWE is the strongest sink in 4 of the 5 basins, while MERRA forcing is the lowest in 2 
basins (North Central Pacific and Atlantic). All the forcings indicate a much stronger sink 
estimate in the South Atlantic and Pacific than the in situ estimates. 

Global area-weighted mean partial pressures show similar relationships among the four 
reanalysis forcings and with the data (Figure 7). The deviations from data are much smaller than 
the flux estimates: all are within 1% of data global means, with ECMWE the outlier at 0.6%. 
NCEPl pC02 is closest to the data, with a difference <1 patm, or -0.1%. All forcings also show 
positive and statistically significant correlations across basins, with values similar to the fluxes. 

On basin scales the pC02 mean differences between the forcings and data are smaller, and 
more consistent with one another than for the basin fluxes (Figure 7). The South Atlantic is a 
notable exception, which exhibits a departure from the data for all forcings similar to the fluxes. 
NCEP2 forcing is noticeably closer to the data pC02 but it is still low by 26 patm (about 7%). 
Modeled North Atlantic pC02 estimates are high compared to data (maximum of 12 patm, or 
3.5%, by ECMWE). The Equatorial Atlantic estimates are consistent with data (Figure 7), in 
contrast to the fluxes (Figure 5). 

Spatial distribution of pC02 from the different forcings generally show similar patterns as the 
air-sea fluxes, but the contrast between highs and lows is reduced (Figure 8). ECMWE has the 
lowest PCO 2 in the southern 60° band where the fluxes are large and positive, but otherwise the 
features are comparable. 


8 


244 

245 

246 

247 

248 

249 

250 

251 

252 

253 

254 

255 

256 

257 

258 

259 

260 

261 

262 

263 

264 

265 

266 

267 

268 

269 

270 

271 

272 

273 


Selected variables from the reanalyses particularly relevant to ocean carbon surface fluxes 
include ice concentrations, SST, and wind speed, and are shown in Figure 9. Differences in 
these reanalysis variables in the high latitude basins suggest some reasons for the differences in 
air-sea flux observed in the biogeochemical model (Figure 5). Ice concentrations are similar for 
all four reanalyses estimates in the North Pacific and Antarctic, but there are some apparent 
differences in the North Atlantic. There are considerable differences in SST and wind speed 
among the four reanalyses for all the high latitude basins. 

For the tropical basins, only SST and wind speed are shown, and there are considerable 
differences in the variables among the four reanalysis products (Figure 10). NCEP2 is 
consistently warmer than the other reanalyses, more than l^C above the lowest estimate in 3 of 
the 4 basins, and nearly 1°C in the North Indian. Additionally, NCEP2 always exhibits the 
highest annual mean wind speeds, occasionally rising to nearly 1 m s"' higher than the others. At 
the other extreme, MERRA and NCEPl have nearly identical annual mean SST and wind speeds 
in all the tropical basins. ECMWE and NCEPl have nearly identical SST in the Equatorial 
Indian, Pacific, and Atlantic. 

In addition to the full global representations of the model and the in situ PCO 2 gridded, re- 
sampled, and interpolated climatology from LDEO, we provide the non-interpolated point 
measurements and the corresponding model with the sampling biases of the data in time and 
space removed (Eigure 1 1). This provides a more realistic comparison of the model and data to 
enable improved evaluation of model issues. A difference map (Figure 12) provides an 
enhancement of the comparison. A side-by-side comparison of PCO 2 , both with data sampling 
biases and without completes the comparison (Figure 13). 

4.0 Discussion 

4.1 Reanalysis-Forcing Model Results 

Global annual mean air-sea carbon fluxes and pC02 are largely independent of the choice of 
reanalysis forcing (Figures 5 and 7). The flux estimates are similar, the sign of the fluxes (source 
or sink) by basin are identical, and correlations with in situ estimates across major oceanographic 
basins are positive and statistically significant (P<0.05) regardless of the reanalysis forcing used. 
Correlations for pC02 are similarly positive and significant. The maximum variability in fluxes 
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is about +20%, which suggests the magnitude of uncertainty in oeean carbon models due to 
ehoiee of reanalyses. 

That global air-sea earbon fluxes and pCOi are generally similar regardless of reanalysis 
foreing is reassuring. It suggests that at the largest spatial scales, state-of-the-art representations 
of physical processes and assimilation approaches embedded in the reanalysis methods, while 
quite different among the different reanalyses, produee consistent results. In essenee, this means 
that important variables used for ocean carbon model forcing are similar on global scales, and 
that whatever important differences there are among the four reanalysis products, global ocean 
earbon mean fluxes and pC02 are insensitive to them. This is less sweeping when one considers 
that only a portion of the vast reanalysis variables produeed are important in ocean earbon 
modeling, the most important of whieh are surfaee temperature, wind speeds and stresses, and 
ice distributions, and when the sensitivities of ocean earbon models are determined by eomplex 
interactions in the model formulations. 

Although the global carbon flux and pCOa distributions are similar among reanalyses, there are 
considerable differences on oeeanographie basin scales. Air-sea carbon fluxes, which, as small 
differences between large values of atmospheric and ocean pC 02 , are especially sensitive to 
small variations in the representation of atmospheric forcing by reanalysis products. None of the 
reanalysis products are uniformly superior in all basins, nor are any uniformly inferior, as 
eompared to in situ estimates. The differenees among the reanalyses are largest in the high 
latitudes and the tropics, which incidentally represent the basins of strongest sinks and strongest 
sourees, respeetively. 

Few of the major departures observed in MERRA forcing, such as the South Atlantic and 
Pacific, North Indian, North Central Pacific, and North Pacific, are reetified by the other 
reanalysis products (Figure 5). ECMWF forcing, however, substantially ameliorates the 
departures observed in the MERRA and NCEP forcings in the North Indian and the Equatorial 
Paeifie. 

Attribution of the differences of air-sea fluxes to specific variables in the reanalysis produets is 
diffieult because of the eomplexity of the ocean earbon eyele. Additionally, differenees in annual 
mean fluxes shown here ean be the result of seasonal differenees in reanalysis produets. A complete 
analysis of the effects of the reanalysis products and their influences on the representation of the 
global oeean earbon cycle is beyond the scope of this paper. However, it is worthwhile to attempt 
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to relate differences in forcing with differences in fluxes, at least at coarse basin and annual scales, 
to assist in understanding how the reanalysis variables are affecting the observed changes in the 
representation of the global ocean carbon cycle. 

First, we note that there are really only 6 reanalysis variables affecting the air-sea fluxes in this 
biogeochemical model: ice concentrations, SST, surface pressure, wind speeds, and the x and y 
components of wind stress (Figure 1). We can partially separate these into those that 1) affect 
the circulation, and therefore affect the biology, chemistry, and physical transfers of carbon in 
the water column and producing the surface distributions where the exchange with the 
atmosphere occur (SST and wind stress), and 2) those that directly affect the exchange of carbon 
between the ocean and atmosphere (sea ice, wind speeds and pressure). Here we restrict our 
investigation to SST, sea ice, and wind speeds. Pressure plays a modest role in the air-sea flux 
and the differences among the reanalysis products is relatively small. Wind stresses are critical 
drivers of the circulation patterns and vertical processes, but they operate in complex ways and 
much of their influence is reflected in the SST. 

Beginning with the high latitudes, the Antarctic basin exhibits a very large range of estimated 
fluxes from the different reanalysis products (Figure 5), with NCEP2 producing a much lower 
sink than the other reanalyses. The NCEP2 reanalysis coincidentally has the highest SST (>1‘’C 
higher than the lowest from ECMWF), and the highest wind speeds (1.4 m s'* higher than the 
lowest, represented by NCEPl), as seen in Figure 6. The higher temperature from NCEP2 
coupled with stronger winds is consistent with stronger outgassing of CO 2 in the Antarctic, 
which would produce a reduced basin scale sink, as observed here. 

In the northern high latitudes, MERRA forcing produces the weakest sinks, which correspond 
with relatively low wind speeds (Eigure 9). MERRA winds are >1 ms'* lower than the highest 
winds in both the North Pacific and North Atlantic. These low winds in MERRA are consistent 
with reduced exchange of pC 02 with the atmosphere and result in reduced sinks of atmospheric 
carbon. The relatively high SST of MERRA may also play a role in weakening the North 
Atlantic fluxes. Similarly, we note that the strongest sinks in the North Atlantic are produced by 
NCEP2 and NCEPl. NCEP2 has the strongest winds, while NCEPl has the lowest SST’s. 

The tropical basins produce the largest range in air-sea carbon fluxes among the 4 reanalysis 
products (Eigures 5 and 6). The most notable divergences are NCEP2 (strongest source) and 
MERRA (weakest source) in the Equatorial Pacific. NCEP2 SST and wind speeds are both the 


11 


336 

337 

338 

339 

340 

341 

342 

343 

344 

345 

346 

347 

348 

349 

350 

351 

352 

353 

354 

355 

356 

357 

358 

359 

360 

361 

362 

363 

364 

365 


largest of the reanalyses (Figure 10). NCEP2 SST is >1“C higher than the lowest (ECMWF, 
although NCEPl and MERRA are consistent to within 0.03‘’C), and NCEP2 wind speed is 0.9 m 
s'^ higher than the lowest, represented by NCEPl. These high SST’s and wind speeds can be 
associated with stronger outgassing as observed in the fluxes. The converse is true as well: 
NCEPl’s and MERRA’s weaker winds produce lower fluxes, despite high pC02 than the data 
(Eigure 7). A similar series of observations occur in the Equatorial Atlantic, with NCEP2’s 
stronger representation of a source to the atmosphere (Eigure 5) is associated with the highest 
SST and wind speed (Eigure 10). The weakest source produced by MERRA forcing in these two 
basins is more difficult to attribute to MERRA’s representation of SST and wind speed. 

4,2 Inherent Model/Data Issues 

Although not the primary focus of this effort, the intercomparison of simulated fluxes and 
pCOi from four different reanalysis products provides an opportunity to gain insights into 
inherent model and data ocean carbon issues. Eirst we note that the reanalysis products are 
largely not capable of rectifying the major discrepancies between the model and data. Second 
we note that as we descend from coarser to finer resolution, the issues become more important. 

Eor both air-sea fluxes and pCOi, global model agreement with in situ data is strong, with 
maximum deviations of 19% for ECO 2 and 0.6% for pC02 among all the reanalysis forcing 
products (Eigures 5 and 7). Deviations for pC02 are much smaller than fluxes. Basin 
correlations are statistically significant at P<0.05 for all forcings for both ECO 2 and pC02, and 
correlation coefficients range from 0.73 to 0.80. 

On regional scales, more model-data deviations are apparent and they can be large at times. 
We note particularly the South Atlantic and to a lesser extent the North Atlantic (Eigures 5 and 
7). Eor air-sea fluxes, additional problems are seen in the Pacific basins (except the Equatorial 
Pacific) and the Equatorial Atlantic. pC02 estimates exhibit much smaller discrepancies in the 
above basins but not in the North and South Atlantic (Eigure 7). Since the results from the 
different forcings only partially alleviate the model-data differences, we suggest that here the 
problems arise in the model formulation and/or the comparison with in situ data. 

On smaller scales the discrepancies between model and data are larger still (Figures 1 1 and 12). 
For the full model domain and interpolated in situ climatology (top panels in Figure 11), 
noteworthy deviations are the high source regions in the model in the Southern Ocean along the 
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60°S band, high sources along the US/Canada East and West coasts in the North Atlantic and 
Pacific, and model sinks in the southern sub-tropical Atlantic and Pacific. 

The 60“S Southern Ocean band of high atmospheric source is common to all the reanalysis 
versions, and the diserepancy is partially the result of sampling biases in the in situ data. Public 
data sets of pC02 and FCO 2 (Takahashi et ah, 2009) are taken from point measurements in the 
ocean, gridded to 5° longitude by 4“ latitude, binned to an annual mean climatology, and with 
residual gaps filled. Each of these steps potentially introduces a bias in the final result, and is 
especially important when comparing to model annual means, which have no sampling issues. 
Binning to a coarse grid reduces variability and over-represents the influences of observation 
points closest to gaps. Construeting annual means where data exist for only a few months 
ereates an unbalanced representation, with the sampled months over-represented. If the sampled 
months occur at a low or high point in the seasonal cycle, the problem is exacerbated. Eilling 
gaps, like binning to a coarse grid, over-represents the influence of observations nearest the gaps 
unless methods are actively used to reduce this problem. Takahashi et al. (2009) used an 
interpolation scheme based on assumed advective transport. 

When we sub-sample the model to match the point measurement locations and months 
observed, and construet a model representation of data corresponding in time and space to the 
data, we see that the areas of high sources along 60°S are eonsiderably redueed in intensity and 
extent (Figure 11). The localized high source region from longitudes 20°E to 75‘’E nearly 
disappears. Now, the reduetion and disappearance does not mean that the model agrees with 
data. We note that there is some evidence of outgassing in the data in this region, such as the 
portion just north and slightly west of the Ross Sea, and in the central Atlantic sector. However, 
the residual disagreement between the sub-sampled model and data points to model issues. The 
outgassing in the model, and to a lesser extent the data, is intensified in austral autumn and 
winter. This corresponds with high pC02 (data not shown), resulting from convection of deep 
Die and low ocean temperatures. The model is not capable of sequestering carbon uptake and 
sinking by biological processes in austral summer deep enough to avoid return to the surface in 
local winter. We note that other models exhibit outgassing along this 60°S band as well (e.g., 
Doney et ah, 2009), but they are admittedly less intense and less widespread than seen here. 

A similar explanation helps explain the discrepancies between the model and data in the South 
Atlantic. Poor sampling produces a distorted view of the model-data comparison in the 
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interpolated representations. In the sub-sampled model, the correspondence is improved (Figure 
11), although there are mismatches along two north-south lines toward the eastern portion of the 
basin. In fact, the basin mean model-data flux difference here falls from -1.17 mol m' y' in the 
full interpolated data and model to -0.18 mol m' y' in the sub-sampled representation. 

Data sampling issues also contribute to the discrepancies in the South Pacific. Here the basin 
mean model-data flux bias is -0.45 mol m' y' for the interpolated comparison (Figure 5). When 
the sampling biases are removed the difference is nearly half at 0.27 mol m' y' . 

Model-data biases in the North Atlantic and Pacific are more complicated. Some of the 
difference is due to data sampling, as the LDEO data are missing in the northern Labrador Sea 
and the Sea of Okhotsk. But otherwise data sampling in these two basins is relatively complete 
spatially and temporally. The near-coastal source regions in the model near the US/Canada 
borders are in contrast to the data and suggest model formulation issues. Since the discrepancies 
appear in all the reanalysis versions (although variable), they are apparently not due to 
differences in forcing. The possibility of issues in winds, SST, and ice common across all the 
reanalysis products cannot be ignored, but we ascribe 1) excessive upwelling coupled with 
excessive deep carbon in the model, 2) locally high atmospheric pC 02 from fossil fuel burning 
that is not represented in the global mean value used, and 3) inadequate uptake and sequestration 
of carbon by biological processes. 

The Equatorial Atlantic also exhibits large model-data discrepancies in fluxes (Eigure 5). This 
is one of the most perplexing basins, since the model pC 02 results, by all the forcings, are 
consistent with data: ECMWE and MERRA are within 5 patm (1.2%) while the two NCEP 
forcings are within 1 patm (0.2%) (Eigure 7). Eluxes are a non-linear function of pC02 (actually 
delta PCO 2 ), with functions involving wind speed and temperature contributing to the non- 
linearity (Wanninkhof, 1992). Small differences in these variables may produce large changes in 
the fluxes. It is important to remember that the LDEO air-sea fluxes are estimates derived from 
observed ApC02 and estimated wind speeds, along with a gas transfer coefficient (Takahashi et 
ah, 2009). Groger and Mikolajewicz (2011) have suggested that the Schmidt number for flux 
estimates (involved in the gas transfer coefficient) could have issues at temperatures >30‘’C, but 
neither the sea surface temperature climatologies used by LDEO (from Conkright et ah, 2001) or 
the SST climatologies in our reanalysis data ever exceed this threshold in the Equatorial Atlantic. 
Additionally, our use of this parameter is the same as for the in situ estimates (Takahashi et ah. 
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2009). As with several other basins, when we aceount for sampling, the disparity in fluxes is 
much smaller. The in situ flux estimates decline by nearly half, from 0.63 to 0.33 mol Cm' y' . 
This produces in situ flux estimates similar to the NCEP2 fluxes shown in Figure 5. MERRA- 
forced model fluxes sampled to the in situ estimates (Figure 11) decline only about 0.07 mol C 
m' y' , so they remain essentially the same as shown in Figure 5 for this basin. This means that 
when sampling biases are removed, the difference between MERRA-estimated fluxes and in situ 
estimates is about the same as the difference between the model forced by MERRA and by 
NCEP2. Residual differences are likely due to wind speed resolution differences (we interpolate 
reanalysis data to the native model grid, 1.25° longitude by 0.67° latitude, compared to the 
NCEP2 reanalysis re-gridded to 5° longitude by 4° latitude resolution by LDEO). When we 
interpolate our NCEP2 wind speed reanalysis data over the EDEO resolution, we find a mean 
increase of 1.86 m s'* in the Equatorial Atlantic, which would lead to enhanced atmosphere- 
ocean carbon exchange. Re-gridding can be sensitive to data frequency distributions, especially 
in small basins such as this one. It can also increase the influence of values over land, which 
may affect the representation of the mean wind speeds. 


5.0 Summary and Conclusions 

Intercomparison of air-sea carbon fluxes and pCOa spatial distributions using a single ocean 
model and four different reanalysis products shows that global means are insensitive to the 
choice of reanalysis product. This suggests that at least for the variables most important for 
ocean carbon exchange, i.e., wind speeds, SST, and ice, the reanalysis products are either in 
general agreement, or that the differences among them are relatively unimportant at the largest 
spatial scales. This finding is emphatically not true for regional analyses, where large 
differences in FCO 2 are observed depending upon the reanalysis product used for forcing. pC02 
distributions are considerably less sensitive to the choice of reanalysis product. These findings 
have important implications for ocean modelers in choosing reanalysis products: namely that for 
global models it does not matter much, but for regional and local model the selection can have 
important influences on carbon cycling and exchange estimates. 

The finding that different estimates of air-sea fluxes are produced by different reanalyses at 
regional scales reinforces the work by Otero et al. (2013), who used different reanalysis sources 
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in the Bay of Biscay. Several other ocean carbon modeling efforts have utilized versions of 
NCEP forcing data (e.g., LeQuere et ah, 2010, Doney et ah, 2009; McKinley et ah, 2004). 

This effort provides a milepost for evaluating the use of different reanalysis forcing products 
for ocean carbon models, at least in a general sense. The overarching conclusion, i.e., that global 
estimate of carbon fluxes and pC 02 are insensitive to the choice of forcing is likely robust. 
Similarly the other conclusions that regionally and sub-regionally the choice of reanalysis has 
successively more influence, is also likely to apply to other models as well. However the nature 
of the differences and sensitivities is likely to be different. The difference will be dependent 
upon the nature of the model formulation, but we hope the results provided here will be of help 
in the selection and use of reanalysis products for global and regional ocean carbon models. 
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Figure Captions 


Figure 1. Interactions among the main components ofNOBM, nominal outputs, and forcing 
fields. lOP indicates inherent optical properties. Forcing variables are shown in the gray boxes. 
Reanalysis forcing variables are in bold. Surface pressure and precipitable water effects on 
surface irradiance play a small role in the inorganic carbon results and are ignored in this effort. 

Figure 2. Pathways and interactions among the components of the biogeochemical processes 
model, illustrating the interactions with the carbon cycle, comprising dissolved inorganic carbon 
(DlC), dissolved organic carbon and exchanges with the atmosphere as a function of the ocean 
and atmosphere partial pressures of CO 2 (PCO 2 ). The biological pump is represented by 
phytoplankton, herbivores, nutrients, and detritus. Color reproduction requested 

Figure 3. Evolution of pC02, silicon, and nitrate global surface means over the 200-year spinup. 

Figure 4. Delineation of the 12 major oceanographic basins. 

Figure 5. Global and basin annual mean air-sea carbon fluxes (FCO 2 ). The basins are arranged 
by high latitudes (top right), tropical (bottom left), and sub-polar (bottom right). Correlation 
coefficients compared with in situ estimates are shown in the global plot, where an asterisk 
indicates P<0.05. Color reproduction requested 

Figure 6. Annual mean air-sea carbon fluxes (FCO 2 ) from the model using forcing fields from 
different reanalysis products. Color reproduction requested. 

Figure 7. Global and basin annual mean pC02. The basins are arranged by high latitudes (top 
right), tropical (bottom left), and sub-polar (bottom right). Correlation coefficients compared 
with in situ estimates are shown in the global plot, where an asterisk indicates P<0.05. Color 
reproduction requested 
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Figure 8. Annual mean pC02 from the model using forcing fields from different reanalysis 
products. Color reproduction requested. 


Figure 9. Annual mean ice concentration (percent), SST (degrees C), and wind speeds (m s'^) for 
the 4 reanalysis products in the high latitude basins. Color reproduction requested. 

Figure 10. Annual mean SST (degrees C - 20,), and wind speeds (m s'^) for the 4 reanalysis 
products in the tropical basins. Note that SST is subtracted by 20“ to fit on the same scale as 
wind speed. Color reproduction requested. 

Figure 11. Model and in situ FCO 2 (top), and sub-sampled model corresponding to point 
measurements in situ FCO 2 (bottom). The model is represented by the MERRA-forced version. 
The model is sub-sampled by month and location to the in situ data to produce a comparable 
annual mean. 

Figure 12. Difference in model forcing by the four different reanalysis products and in situ 
FCO 2 . 


Figure 13. Model and in situ gridded pC02 (top), and sub-sampled model corresponding to point 
measurements in situ pC02 (bottom). The model is represented by the MERRA-forced version. 
The model is sub-sampled by month and location to the in situ data to produce a comparable 
annual mean. 
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